Overview
Brought to you by YData
Dataset statistics
| Number of variables | 15 |
|---|---|
| Number of observations | 16270 |
| Missing cells | 45187 |
| Missing cells (%) | 18.5% |
| Duplicate rows | 542 |
| Duplicate rows (%) | 3.3% |
| Total size in memory | 2.5 MiB |
| Average record size in memory | 160.5 B |
Variable types
| Categorical | 4 |
|---|---|
| Numeric | 11 |
| Dataset has 542 (3.3%) duplicate rows | Duplicates |
current_attendance_in_any_education_instituition has 418 (2.6%) missing values | Missing |
highest_level_of_education has 775 (4.8%) missing values | Missing |
main_activity_engaged_in has 2133 (13.1%) missing values | Missing |
main_occupation has 9902 (60.9%) missing values | Missing |
daily_wage_owner_or_not has 10090 (62.0%) missing values | Missing |
employment_status_of_the_main_occupation has 9902 (60.9%) missing values | Missing |
member_went_out_for_work_or_not_during_last_week has 11967 (73.6%) missing values | Missing |
highest_level_of_education has 216 (1.3%) zeros | Zeros |
no_of_hours_stayed_at_home_during_last_week has 699 (4.3%) zeros | Zeros |
Reproduction
| Analysis started | 2024-11-18 08:36:49.218635 |
|---|---|
| Analysis finished | 2024-11-18 08:37:01.677034 |
| Duration | 12.46 seconds |
| Software version | ydata-profiling vv4.11.0 |
| Download configuration | config.json |
Variables
member_ID
Categorical
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 770.3 KiB |
| I_1 | |
|---|---|
| I_2 | |
| I_3 | |
| I_4 | |
| I_5 | |
| Other values (8) |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.0027658 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | I_1 |
|---|---|
| 2nd row | I_2 |
| 3rd row | I_3 |
| 4th row | I_4 |
| 5th row | I_1 |
Common Values
| Value | Count | Frequency (%) |
| I_1 | 4063 | |
| I_2 | 3877 | |
| I_3 | 3275 | |
| I_4 | 2457 | |
| I_5 | 1443 | 8.9% |
| I_6 | 671 | 4.1% |
| I_7 | 264 | 1.6% |
| I_8 | 120 | 0.7% |
| I_9 | 55 | 0.3% |
| I_10 | 26 | 0.2% |
| Other values (3) | 19 | 0.1% |
Length
| Value | Count | Frequency (%) |
| i_1 | 4063 | |
| i_2 | 3877 | |
| i_3 | 3275 | |
| i_4 | 2457 | |
| i_5 | 1443 | 8.9% |
| i_6 | 671 | 4.1% |
| i_7 | 264 | 1.6% |
| i_8 | 120 | 0.7% |
| i_9 | 55 | 0.3% |
| i_10 | 26 | 0.2% |
| Other values (3) | 19 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 16270 | |
| _ | 16270 | |
| 1 | 4119 | 8.4% |
| 2 | 3883 | 7.9% |
| 3 | 3277 | 6.7% |
| 4 | 2457 | 5.0% |
| 5 | 1443 | 3.0% |
| 6 | 671 | 1.4% |
| 7 | 264 | 0.5% |
| 8 | 120 | 0.2% |
| Other values (2) | 81 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 48855 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| I | 16270 | |
| _ | 16270 | |
| 1 | 4119 | 8.4% |
| 2 | 3883 | 7.9% |
| 3 | 3277 | 6.7% |
| 4 | 2457 | 5.0% |
| 5 | 1443 | 3.0% |
| 6 | 671 | 1.4% |
| 7 | 264 | 0.5% |
| 8 | 120 | 0.2% |
| Other values (2) | 81 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 48855 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| I | 16270 | |
| _ | 16270 | |
| 1 | 4119 | 8.4% |
| 2 | 3883 | 7.9% |
| 3 | 3277 | 6.7% |
| 4 | 2457 | 5.0% |
| 5 | 1443 | 3.0% |
| 6 | 671 | 1.4% |
| 7 | 264 | 0.5% |
| 8 | 120 | 0.2% |
| Other values (2) | 81 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 48855 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| I | 16270 | |
| _ | 16270 | |
| 1 | 4119 | 8.4% |
| 2 | 3883 | 7.9% |
| 3 | 3277 | 6.7% |
| 4 | 2457 | 5.0% |
| 5 | 1443 | 3.0% |
| 6 | 671 | 1.4% |
| 7 | 264 | 0.5% |
| 8 | 120 | 0.2% |
| Other values (2) | 81 | 0.2% |
age
Real number (ℝ)
| Distinct | 97 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.39287 |
| Minimum | 0 |
|---|---|
| Maximum | 98 |
| Zeros | 149 |
| Zeros (%) | 0.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 770.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 19 |
| median | 38 |
| Q3 | 56 |
| 95-th percentile | 75 |
| Maximum | 98 |
| Range | 98 |
| Interquartile range (IQR) | 37 |
Descriptive statistics
| Standard deviation | 22.075172 |
|---|---|
| Coefficient of variation (CV) | 0.57498103 |
| Kurtosis | -0.99725102 |
| Mean | 38.39287 |
| Median Absolute Deviation (MAD) | 18 |
| Skewness | 0.14195376 |
| Sum | 624652 |
| Variance | 487.31322 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 19 | 282 | 1.7% |
| 17 | 278 | 1.7% |
| 23 | 276 | 1.7% |
| 15 | 267 | 1.6% |
| 18 | 266 | 1.6% |
| 20 | 263 | 1.6% |
| 16 | 256 | 1.6% |
| 42 | 252 | 1.5% |
| 22 | 251 | 1.5% |
| 45 | 245 | 1.5% |
| Other values (87) | 13634 |
| Value | Count | Frequency (%) |
| 0 | 149 | |
| 1 | 131 | |
| 2 | 138 | |
| 3 | 186 | |
| 4 | 171 | |
| 5 | 177 | |
| 6 | 163 | |
| 7 | 164 | |
| 8 | 196 | |
| 9 | 198 |
| Value | Count | Frequency (%) |
| 98 | 1 | < 0.1% |
| 96 | 1 | < 0.1% |
| 95 | 3 | < 0.1% |
| 93 | 8 | < 0.1% |
| 92 | 5 | < 0.1% |
| 91 | 7 | < 0.1% |
| 90 | 17 | |
| 89 | 20 | |
| 88 | 16 | |
| 87 | 14 |
relationship_to_the_head_of_household
Real number (ℝ)
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.8720344 |
| Minimum | 1 |
|---|---|
| Maximum | 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 770.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 3 |
| 95-th percentile | 75 |
| Maximum | 109 |
| Range | 108 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 20.1174 |
|---|---|
| Coefficient of variation (CV) | 2.5555529 |
| Kurtosis | 11.327254 |
| Mean | 7.8720344 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 3.5888571 |
| Sum | 128078 |
| Variance | 404.70979 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 5654 | |
| 1 | 4012 | |
| 2 | 3226 | |
| 5 | 1198 | 7.4% |
| 75 | 685 | 4.2% |
| 6 | 666 | 4.1% |
| 4 | 434 | 2.7% |
| 97 | 237 | 1.5% |
| 86 | 101 | 0.6% |
| 109 | 52 | 0.3% |
| Other values (2) | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 4012 | |
| 2 | 3226 | |
| 3 | 5654 | |
| 4 | 434 | 2.7% |
| 5 | 1198 | 7.4% |
| 6 | 666 | 4.1% |
| 12 | 3 | < 0.1% |
| 75 | 685 | 4.2% |
| 86 | 101 | 0.6% |
| 88 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 109 | 52 | 0.3% |
| 97 | 237 | 1.5% |
| 88 | 2 | < 0.1% |
| 86 | 101 | 0.6% |
| 75 | 685 | 4.2% |
| 12 | 3 | < 0.1% |
| 6 | 666 | 4.1% |
| 5 | 1198 | 7.4% |
| 4 | 434 | 2.7% |
| 3 | 5654 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 8386 | |
| 0 | 7884 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 8386 | |
| 0 | 7884 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 8386 | |
| 0 | 7884 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 16270 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 8386 | |
| 0 | 7884 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 16270 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 8386 | |
| 0 | 7884 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 16270 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 8386 | |
| 0 | 7884 |
ethnicity
Real number (ℝ)
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.4357714 |
| Minimum | 1 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 770.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 4 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.0512815 |
|---|---|
| Coefficient of variation (CV) | 0.73220675 |
| Kurtosis | 4.8198478 |
| Mean | 1.4357714 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.326844 |
| Sum | 23360 |
| Variance | 1.1051928 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 13560 | |
| 4 | 1968 | 12.1% |
| 2 | 572 | 3.5% |
| 3 | 82 | 0.5% |
| 6 | 42 | 0.3% |
| 5 | 32 | 0.2% |
| 9 | 14 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 13560 | |
| 2 | 572 | 3.5% |
| 3 | 82 | 0.5% |
| 4 | 1968 | 12.1% |
| 5 | 32 | 0.2% |
| 6 | 42 | 0.3% |
| 9 | 14 | 0.1% |
| Value | Count | Frequency (%) |
| 9 | 14 | 0.1% |
| 6 | 42 | 0.3% |
| 5 | 32 | 0.2% |
| 4 | 1968 | 12.1% |
| 3 | 82 | 0.5% |
| 2 | 572 | 3.5% |
| 1 | 13560 |
religion
Real number (ℝ)
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.8679164 |
| Minimum | 1 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 770.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.306145 |
|---|---|
| Coefficient of variation (CV) | 0.69925236 |
| Kurtosis | -0.14726716 |
| Mean | 1.8679164 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.0956388 |
| Sum | 30391 |
| Variance | 1.7060147 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 10807 | |
| 4 | 2442 | 15.0% |
| 3 | 2053 | 12.6% |
| 5 | 547 | 3.4% |
| 2 | 407 | 2.5% |
| 9 | 8 | < 0.1% |
| 6 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 10807 | |
| 2 | 407 | 2.5% |
| 3 | 2053 | 12.6% |
| 4 | 2442 | 15.0% |
| 5 | 547 | 3.4% |
| 6 | 6 | < 0.1% |
| 9 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 9 | 8 | < 0.1% |
| 6 | 6 | < 0.1% |
| 5 | 547 | 3.4% |
| 4 | 2442 | 15.0% |
| 3 | 2053 | 12.6% |
| 2 | 407 | 2.5% |
| 1 | 10807 |
marital_status
Real number (ℝ)
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.9127843 |
| Minimum | 1 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 770.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 4 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.1643284 |
|---|---|
| Coefficient of variation (CV) | 0.60870866 |
| Kurtosis | 13.576298 |
| Mean | 1.9127843 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.9990392 |
| Sum | 31121 |
| Variance | 1.3556605 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 7417 | |
| 1 | 6330 | |
| 3 | 1311 | 8.1% |
| 4 | 893 | 5.5% |
| 9 | 138 | 0.8% |
| 7 | 64 | 0.4% |
| 8 | 52 | 0.3% |
| 5 | 44 | 0.3% |
| 6 | 21 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 6330 | |
| 2 | 7417 | |
| 3 | 1311 | 8.1% |
| 4 | 893 | 5.5% |
| 5 | 44 | 0.3% |
| 6 | 21 | 0.1% |
| 7 | 64 | 0.4% |
| 8 | 52 | 0.3% |
| 9 | 138 | 0.8% |
| Value | Count | Frequency (%) |
| 9 | 138 | 0.8% |
| 8 | 52 | 0.3% |
| 7 | 64 | 0.4% |
| 6 | 21 | 0.1% |
| 5 | 44 | 0.3% |
| 4 | 893 | 5.5% |
| 3 | 1311 | 8.1% |
| 2 | 7417 | |
| 1 | 6330 |
current_attendance_in_any_education_instituition
Real number (ℝ)
Missing 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 418 |
| Missing (%) | 2.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.4575448 |
| Minimum | 1 |
|---|---|
| Maximum | 8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 770.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4 |
| median | 8 |
| Q3 | 8 |
| 95-th percentile | 8 |
| Maximum | 8 |
| Range | 7 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.5556012 |
|---|---|
| Coefficient of variation (CV) | 0.39575432 |
| Kurtosis | -0.64997855 |
| Mean | 6.4575448 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -1.1196853 |
| Sum | 102365 |
| Variance | 6.5310976 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 11435 | |
| 2 | 2964 | 18.2% |
| 3 | 560 | 3.4% |
| 1 | 298 | 1.8% |
| 4 | 233 | 1.4% |
| 5 | 191 | 1.2% |
| 6 | 105 | 0.6% |
| 7 | 66 | 0.4% |
| (Missing) | 418 | 2.6% |
| Value | Count | Frequency (%) |
| 1 | 298 | 1.8% |
| 2 | 2964 | 18.2% |
| 3 | 560 | 3.4% |
| 4 | 233 | 1.4% |
| 5 | 191 | 1.2% |
| 6 | 105 | 0.6% |
| 7 | 66 | 0.4% |
| 8 | 11435 |
| Value | Count | Frequency (%) |
| 8 | 11435 | |
| 7 | 66 | 0.4% |
| 6 | 105 | 0.6% |
| 5 | 191 | 1.2% |
| 4 | 233 | 1.4% |
| 3 | 560 | 3.4% |
| 2 | 2964 | 18.2% |
| 1 | 298 | 1.8% |
highest_level_of_education
Real number (ℝ)
Missing  Zeros 
| Distinct | 20 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 775 |
| Missing (%) | 4.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.294095 |
| Minimum | 0 |
|---|---|
| Maximum | 19 |
| Zeros | 216 |
| Zeros (%) | 1.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 770.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 10 |
| median | 12 |
| Q3 | 14 |
| 95-th percentile | 16 |
| Maximum | 19 |
| Range | 19 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.6555127 |
|---|---|
| Coefficient of variation (CV) | 0.32366584 |
| Kurtosis | 0.90851411 |
| Mean | 11.294095 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -1.0071066 |
| Sum | 175002 |
| Variance | 13.362773 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 14 | 2989 | |
| 12 | 2697 | |
| 11 | 2484 | |
| 16 | 1333 | |
| 13 | 1219 | |
| 9 | 772 | 4.7% |
| 10 | 638 | 3.9% |
| 6 | 567 | 3.5% |
| 8 | 509 | 3.1% |
| 7 | 407 | 2.5% |
| Other values (10) | 1880 | |
| (Missing) | 775 | 4.8% |
| Value | Count | Frequency (%) |
| 0 | 216 | 1.3% |
| 1 | 143 | 0.9% |
| 2 | 161 | 1.0% |
| 3 | 233 | 1.4% |
| 4 | 277 | 1.7% |
| 5 | 327 | |
| 6 | 567 | |
| 7 | 407 | |
| 8 | 509 | |
| 9 | 772 |
| Value | Count | Frequency (%) |
| 19 | 42 | 0.3% |
| 18 | 84 | 0.5% |
| 17 | 235 | 1.4% |
| 16 | 1333 | |
| 15 | 162 | 1.0% |
| 14 | 2989 | |
| 13 | 1219 | |
| 12 | 2697 | |
| 11 | 2484 | |
| 10 | 638 | 3.9% |
main_activity_engaged_in
Real number (ℝ)
Missing 
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 2133 |
| Missing (%) | 13.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.5640518 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 770.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 6 |
| Q3 | 7 |
| 95-th percentile | 9 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.210986 |
|---|---|
| Coefficient of variation (CV) | 0.70353846 |
| Kurtosis | -1.7877888 |
| Mean | 4.5640518 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.030884242 |
| Sum | 64522 |
| Variance | 10.310431 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 5473 | |
| 7 | 3465 | |
| 8 | 2513 | |
| 9 | 859 | 5.3% |
| 2 | 707 | 4.3% |
| 4 | 364 | 2.2% |
| 5 | 266 | 1.6% |
| 3 | 215 | 1.3% |
| 6 | 159 | 1.0% |
| 10 | 116 | 0.7% |
| (Missing) | 2133 | 13.1% |
| Value | Count | Frequency (%) |
| 1 | 5473 | |
| 2 | 707 | 4.3% |
| 3 | 215 | 1.3% |
| 4 | 364 | 2.2% |
| 5 | 266 | 1.6% |
| 6 | 159 | 1.0% |
| 7 | 3465 | |
| 8 | 2513 | |
| 9 | 859 | 5.3% |
| 10 | 116 | 0.7% |
| Value | Count | Frequency (%) |
| 10 | 116 | 0.7% |
| 9 | 859 | 5.3% |
| 8 | 2513 | |
| 7 | 3465 | |
| 6 | 159 | 1.0% |
| 5 | 266 | 1.6% |
| 4 | 364 | 2.2% |
| 3 | 215 | 1.3% |
| 2 | 707 | 4.3% |
| 1 | 5473 |
main_occupation
Real number (ℝ)
Missing 
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 9902 |
| Missing (%) | 60.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.8406093 |
| Minimum | 1 |
|---|---|
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 770.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 5 |
| Q3 | 6 |
| 95-th percentile | 11 |
| Maximum | 99 |
| Range | 98 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 10.312635 |
|---|---|
| Coefficient of variation (CV) | 1.7656779 |
| Kurtosis | 72.386715 |
| Mean | 5.8406093 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 8.3275904 |
| Sum | 37193 |
| Variance | 106.35043 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 1777 | 10.9% |
| 2 | 1272 | 7.8% |
| 9 | 605 | 3.7% |
| 4 | 527 | 3.2% |
| 3 | 501 | 3.1% |
| 1 | 469 | 2.9% |
| 6 | 333 | 2.0% |
| 11 | 296 | 1.8% |
| 7 | 271 | 1.7% |
| 8 | 245 | 1.5% |
| (Missing) | 9902 |
| Value | Count | Frequency (%) |
| 1 | 469 | 2.9% |
| 2 | 1272 | |
| 3 | 501 | 3.1% |
| 4 | 527 | 3.2% |
| 5 | 1777 | |
| 6 | 333 | 2.0% |
| 7 | 271 | 1.7% |
| 8 | 245 | 1.5% |
| 9 | 605 | 3.7% |
| 11 | 296 | 1.8% |
| Value | Count | Frequency (%) |
| 99 | 72 | 0.4% |
| 11 | 296 | 1.8% |
| 9 | 605 | 3.7% |
| 8 | 245 | 1.5% |
| 7 | 271 | 1.7% |
| 6 | 333 | 2.0% |
| 5 | 1777 | |
| 4 | 527 | 3.2% |
| 3 | 501 | 3.1% |
| 2 | 1272 |
daily_wage_owner_or_not
Categorical
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 10090 |
| Missing (%) | 62.0% |
| Memory size | 770.3 KiB |
| 2.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 2.0 |
| 3rd row | 2.0 |
| 4th row | 2.0 |
| 5th row | 2.0 |
Common Values
| Value | Count | Frequency (%) |
| 2.0 | 4064 | |
| 1.0 | 2116 | 13.0% |
| (Missing) | 10090 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2.0 | 4064 | |
| 1.0 | 2116 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 6180 | |
| 0 | 6180 | |
| 2 | 4064 | |
| 1 | 2116 | 11.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 18540 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| . | 6180 | |
| 0 | 6180 | |
| 2 | 4064 | |
| 1 | 2116 | 11.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 18540 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| . | 6180 | |
| 0 | 6180 | |
| 2 | 4064 | |
| 1 | 2116 | 11.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 18540 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| . | 6180 | |
| 0 | 6180 | |
| 2 | 4064 | |
| 1 | 2116 | 11.4% |
employment_status_of_the_main_occupation
Real number (ℝ)
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 9902 |
| Missing (%) | 60.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.182946 |
| Minimum | 1 |
|---|---|
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 770.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 6 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.2221368 |
|---|---|
| Coefficient of variation (CV) | 0.38396404 |
| Kurtosis | -0.079889298 |
| Mean | 3.182946 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.079925575 |
| Sum | 20269 |
| Variance | 1.4936183 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 3698 | 22.7% |
| 5 | 1084 | 6.7% |
| 1 | 859 | 5.3% |
| 4 | 415 | 2.6% |
| 2 | 159 | 1.0% |
| 6 | 153 | 0.9% |
| (Missing) | 9902 |
| Value | Count | Frequency (%) |
| 1 | 859 | 5.3% |
| 2 | 159 | 1.0% |
| 3 | 3698 | |
| 4 | 415 | 2.6% |
| 5 | 1084 | 6.7% |
| 6 | 153 | 0.9% |
| Value | Count | Frequency (%) |
| 6 | 153 | 0.9% |
| 5 | 1084 | 6.7% |
| 4 | 415 | 2.6% |
| 3 | 3698 | |
| 2 | 159 | 1.0% |
| 1 | 859 | 5.3% |
no_of_hours_stayed_at_home_during_last_week
Real number (ℝ)
Zeros 
| Distinct | 326 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 124.84774 |
| Minimum | 0 |
|---|---|
| Maximum | 168 |
| Zeros | 699 |
| Zeros (%) | 4.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 770.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 10 |
| Q1 | 96 |
| median | 140 |
| Q3 | 168 |
| 95-th percentile | 168 |
| Maximum | 168 |
| Range | 168 |
| Interquartile range (IQR) | 72 |
Descriptive statistics
| Standard deviation | 47.768165 |
|---|---|
| Coefficient of variation (CV) | 0.38261138 |
| Kurtosis | 0.29131003 |
| Mean | 124.84774 |
| Median Absolute Deviation (MAD) | 28 |
| Skewness | -1.0383694 |
| Sum | 2031272.7 |
| Variance | 2281.7976 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 168 | 5380 | |
| 84 | 761 | 4.7% |
| 0 | 699 | 4.3% |
| 160 | 449 | 2.8% |
| 150 | 439 | 2.7% |
| 120 | 422 | 2.6% |
| 140 | 360 | 2.2% |
| 100 | 333 | 2.0% |
| 108 | 301 | 1.9% |
| 96 | 282 | 1.7% |
| Other values (316) | 6844 |
| Value | Count | Frequency (%) |
| 0 | 699 | |
| 0.142 | 1 | < 0.1% |
| 0.147 | 1 | < 0.1% |
| 0.159 | 1 | < 0.1% |
| 0.168 | 1 | < 0.1% |
| 0.25 | 1 | < 0.1% |
| 0.3 | 1 | < 0.1% |
| 1 | 13 | 0.1% |
| 2 | 13 | 0.1% |
| 2.3 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 168 | 5380 | |
| 167.5 | 1 | < 0.1% |
| 167.3 | 1 | < 0.1% |
| 167.25 | 1 | < 0.1% |
| 167 | 36 | 0.2% |
| 166.5 | 1 | < 0.1% |
| 166 | 87 | 0.5% |
| 165.9 | 1 | < 0.1% |
| 165.75 | 1 | < 0.1% |
| 165.7 | 1 | < 0.1% |
member_went_out_for_work_or_not_during_last_week
Categorical
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 11967 |
| Missing (%) | 73.6% |
| Memory size | 770.3 KiB |
| 1.0 | |
|---|---|
| 3.0 | |
| 2.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 2.0 |
| 5th row | 3.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 2764 | 17.0% |
| 3.0 | 857 | 5.3% |
| 2.0 | 682 | 4.2% |
| (Missing) | 11967 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 2764 | |
| 3.0 | 857 | 19.9% |
| 2.0 | 682 | 15.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 4303 | |
| 0 | 4303 | |
| 1 | 2764 | |
| 3 | 857 | 6.6% |
| 2 | 682 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 12909 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| . | 4303 | |
| 0 | 4303 | |
| 1 | 2764 | |
| 3 | 857 | 6.6% |
| 2 | 682 | 5.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 12909 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| . | 4303 | |
| 0 | 4303 | |
| 1 | 2764 | |
| 3 | 857 | 6.6% |
| 2 | 682 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 12909 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| . | 4303 | |
| 0 | 4303 | |
| 1 | 2764 | |
| 3 | 857 | 6.6% |
| 2 | 682 | 5.3% |
Interactions
Missing values
Sample
| member_ID | age | relationship_to_the_head_of_household | gender | ethnicity | religion | marital_status | current_attendance_in_any_education_instituition | highest_level_of_education | main_activity_engaged_in | main_occupation | daily_wage_owner_or_not | employment_status_of_the_main_occupation | no_of_hours_stayed_at_home_during_last_week | member_went_out_for_work_or_not_during_last_week | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| household_ID | |||||||||||||||
| ID0001 | I_1 | 71 | 1 | 0 | 1 | 1 | 2 | 8.0 | 14.0 | 2.0 | 99.0 | 2.0 | 1.0 | 168.0 | 3.0 |
| ID0001 | I_2 | 66 | 2 | 1 | 1 | 1 | 2 | 8.0 | 14.0 | 7.0 | NaN | NaN | NaN | 168.0 | NaN |
| ID0001 | I_3 | 32 | 3 | 0 | 1 | 1 | 2 | 8.0 | 17.0 | 1.0 | 2.0 | 2.0 | 1.0 | 70.0 | 1.0 |
| ID0001 | I_4 | 30 | 4 | 1 | 1 | 1 | 2 | 8.0 | 17.0 | 1.0 | 2.0 | 2.0 | 1.0 | 150.0 | 1.0 |
| ID0002 | I_1 | 85 | 1 | 0 | 1 | 1 | 2 | 8.0 | 7.0 | 4.0 | NaN | NaN | NaN | 168.0 | NaN |
| ID0002 | I_2 | 66 | 5 | 0 | 1 | 1 | 2 | 8.0 | 14.0 | 2.0 | 7.0 | 2.0 | 3.0 | 0.0 | 2.0 |
| ID0002 | I_3 | 59 | 3 | 1 | 1 | 1 | 2 | 8.0 | 14.0 | 2.0 | 4.0 | 2.0 | 1.0 | 168.0 | 3.0 |
| ID0003 | I_1 | 44 | 1 | 0 | 1 | 1 | 2 | 8.0 | 16.0 | 2.0 | 2.0 | 2.0 | 1.0 | 100.0 | 1.0 |
| ID0003 | I_2 | 41 | 2 | 1 | 1 | 1 | 2 | 8.0 | 17.0 | 2.0 | 2.0 | 2.0 | 1.0 | 100.0 | 1.0 |
| ID0003 | I_3 | 74 | 5 | 1 | 1 | 1 | 4 | 8.0 | 16.0 | 4.0 | NaN | NaN | NaN | 168.0 | NaN |
| member_ID | age | relationship_to_the_head_of_household | gender | ethnicity | religion | marital_status | current_attendance_in_any_education_instituition | highest_level_of_education | main_activity_engaged_in | main_occupation | daily_wage_owner_or_not | employment_status_of_the_main_occupation | no_of_hours_stayed_at_home_during_last_week | member_went_out_for_work_or_not_during_last_week | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| household_ID | |||||||||||||||
| ID4060 | I_1 | 78 | 1 | 1 | 1 | 1 | 1 | 8.0 | 11.0 | 7.0 | NaN | NaN | NaN | 130.0 | NaN |
| ID4061 | I_1 | 82 | 1 | 1 | 1 | 1 | 4 | 8.0 | 12.0 | 7.0 | NaN | NaN | NaN | 168.0 | NaN |
| ID4061 | I_2 | 53 | 3 | 0 | 1 | 1 | 1 | 8.0 | 12.0 | 1.0 | 9.0 | 1.0 | 3.0 | 70.0 | NaN |
| ID4062 | I_1 | 73 | 1 | 0 | 1 | 1 | 2 | 8.0 | 12.0 | 1.0 | 5.0 | 1.0 | 5.0 | 168.0 | NaN |
| ID4062 | I_2 | 66 | 2 | 1 | 1 | 1 | 2 | 8.0 | 12.0 | 7.0 | NaN | NaN | NaN | 168.0 | NaN |
| ID4063 | I_1 | 62 | 1 | 1 | 1 | 1 | 4 | 8.0 | 11.0 | 7.0 | NaN | NaN | NaN | 48.0 | NaN |
| ID4063 | I_2 | 49 | 4 | 0 | 1 | 1 | 2 | 8.0 | 11.0 | 1.0 | 5.0 | 1.0 | 3.0 | 120.0 | NaN |
| ID4063 | I_3 | 42 | 3 | 1 | 1 | 1 | 2 | 8.0 | 14.0 | 7.0 | NaN | NaN | NaN | 48.0 | NaN |
| ID4063 | I_4 | 37 | 3 | 0 | 1 | 1 | 1 | 8.0 | 11.0 | 7.0 | NaN | NaN | NaN | 168.0 | NaN |
| ID4063 | I_5 | 36 | 3 | 0 | 1 | 1 | 1 | 8.0 | 11.0 | 1.0 | 3.0 | 2.0 | 3.0 | 0.0 | NaN |
Duplicate rows
Most frequently occurring
| member_ID | age | relationship_to_the_head_of_household | gender | ethnicity | religion | marital_status | current_attendance_in_any_education_instituition | highest_level_of_education | main_activity_engaged_in | main_occupation | daily_wage_owner_or_not | employment_status_of_the_main_occupation | no_of_hours_stayed_at_home_during_last_week | member_went_out_for_work_or_not_during_last_week | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 103 | I_2 | 42 | 2 | 1 | 1 | 1 | 2 | 8.0 | 14.0 | 7.0 | NaN | NaN | NaN | 168.0 | NaN | 9 |
| 163 | I_2 | 51 | 2 | 1 | 1 | 1 | 2 | 8.0 | 14.0 | 7.0 | NaN | NaN | NaN | 168.0 | NaN | 9 |
| 356 | I_4 | 0 | 3 | 1 | 1 | 1 | 1 | NaN | NaN | NaN | NaN | NaN | NaN | 168.0 | NaN | 9 |
| 130 | I_2 | 46 | 2 | 1 | 1 | 1 | 2 | 8.0 | 14.0 | 7.0 | NaN | NaN | NaN | 168.0 | NaN | 8 |
| 245 | I_2 | 67 | 2 | 1 | 1 | 1 | 2 | 8.0 | 12.0 | 7.0 | NaN | NaN | NaN | 168.0 | NaN | 8 |
| 261 | I_3 | 1 | 3 | 0 | 1 | 1 | 1 | NaN | NaN | NaN | NaN | NaN | NaN | 168.0 | NaN | 8 |
| 366 | I_4 | 2 | 3 | 1 | 1 | 1 | 1 | NaN | NaN | NaN | NaN | NaN | NaN | 168.0 | NaN | 8 |
| 79 | I_2 | 38 | 2 | 1 | 1 | 1 | 2 | 8.0 | 14.0 | 7.0 | NaN | NaN | NaN | 168.0 | NaN | 7 |
| 146 | I_2 | 48 | 2 | 1 | 1 | 1 | 2 | 8.0 | 14.0 | 7.0 | NaN | NaN | NaN | 168.0 | NaN | 7 |
| 167 | I_2 | 52 | 2 | 1 | 1 | 1 | 2 | 8.0 | 14.0 | 7.0 | NaN | NaN | NaN | 168.0 | NaN | 7 |